COMPOSITION AND METHOD FOR OPTIMIZED HYBRIDIZATION 
USING MODIFIED SOLUTIONS 



FIELD OF THE INVENTION 

The present invention relates to the field of nucleic acids and more particularly to 
a modified solution containing a denaturant for application with biopolymers and micro 
arrays. 

BACKGROUND OF THE INVENTION 

Various arrays of polynucleotides (such as RNA and DNA) are known and used 
in genetic testing, screening and diagnostics. Arrays are defined by the regions of 
different biopolymers or nucleotides arranged in a predetermined configuration on a 
substrate. Most importantly, the arrays when exposed to a population of analytes will 
exhibit a pattern indicative of the presence of the various components separated spatially. 
Array binding patterns of polynucleotides and/or peptides can be detected by using a 
variety of suitable target labels. Once bound to the array, these target labels can then be 
quantified and observed and the overall pattern on the array determined. 

DNA micro arrays are particularly useful for analyzing large sets of genes through 
"gene expression profiling". However, for the micro arrays to be effective in binding 
target sequences, they must be capable of annealing. In addition, the detection of optimal 
"sensitivity" and "specificity" of hybridization for biopolymers (probes) to 
complimentary sequences in complex cRNA is complicated by the fact that the 
characteristic melting temperature (Tm) for this interaction rises with increasing probe 
length. However, increasing the stringency of hybridization comes with the cost of 
losing sensitivity. Moreover, the higher temperatures needed to reach the Tm (> 60 °C) 
are detrimental to the array surface. Previous researchers have determined that including 
30% formamide in the hybridization solution can significantly reduce the Tm and allow 
for hybridization to take place at a lower temperature while maintaining an acceptable 
balance between specificity and sensitivity. Formamide is often used in Southern and 
Northern blotting as a hybridization modifier, because it is effective and because it is 



easily washed out of the nitrocellulose or modified nylon materials used to perform 
porous filter-based hybridization assays. However, formamide is a highly toxic and 
hazardous to ship (the US Department of Transportation requires double-containment of 
formamide-containing solutions). For these reasons it would be desirable to create an 
5 effective solution system that maintains specificity, sensitivity and allows for 
hybridizations at lower temperature, yet is not toxic, does not effect the central nervous 
system of the operator, is not a fetal poison or does not require special handling or 
transportation costs. These and other problems with the prior art systems and solutions 
are obviated by the present invention. The references cited in this application infra and 
10 supra, are hereby incorporated by reference. However, cited references or art are not 
admitted to be prior art to this application. 

5 SUMMARY OF THE INVENTION 

0 

jti 15 The invention provides a composition, kit and method for use with micro arrays. 

fV'li 

The composition, kit and method allow a probe and target to hybridize at a temperature 
« lower than their standard hybridization temperature. The composition has a chemical 

U component of the formula: 

5 20 R(NH 2 )C=0 

where R is an amino or alkyl group. The kit of the present invention comprises a micro 
array, a composition for use with the micro array and a target for detection. The kit may 
further comprise an optional set of instructions. The composition in the kit contains the 
25 chemical component. The chemical component may or may not be in solution. The 
method provides the steps of adding to a probe and target the chemical component, 
heating the probe and target in the presence of the added component and then allowing 
the biopolymers to hybridize. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



Embodiments of the invention will now be described with reference to the 
drawings in which: 

5 

FIG. 1 illustrates a single nucleotide polymorphic array. 

FIG. 2 is an enlarged view of a portion of FIG. 1 showing multiple spots or 
regions of an array. 

10 

FIG. 3 shows a duplex hybridization curve without the addition of the denaturant. 

H Percentage of single stranded DNA is plotted against temperature. 

O 

pf FIG. 4 shows a plot of percentage of single stranded DNA against temperature. 

ffl 15 The plot shows a series of melting curves (25 , 60 and 60 mer with denaturant added). 

GO 

FIG. 5 shows a contour plot of the observed optimal conditions using urea 
□ concentrations with varying probe lengths (L= 60, T = 50 °C). Probe length has been 

j t plotted against urea concentration. 

O 20 

FIG. 6 shows a plot of optimal urea concentrations of 50 °C probe length series. 
The plot shows average GCN4 specificity plotted against urea concentration. 

FIG. 7 shows the extrapolated optimal urea concentration for a probe length of 
25 60-mer at a temperature of 50 °C. Optimum urea concentration at 50 °C is plotted against 
probe length. 

FIG. 8 shows a contour plot of the extrapolation to predicted optimum for the 
probe having a length of 60 and 50 °C. Probe length has been plotted against urea 
30 concentration. 
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FIG. 9 shows the results of various arrays with the addition of the denaturant. 
DETAILED DESCRIPTION OF THE INVENTION 

Before describing the present invention in detail, it is to be understood that this 
invention is not limited to specific compositions, process steps, or equipment, as such 
may vary. It is also to be understood that the terminology used herein is for the purpose 
of describing particular embodiments only, and is not intended to be limiting. 

It must be noted that, as used in this specification and the appended claims, the 
singular forms "a", "an" and "the" include plural referents unless the context clearly 
dictates otherwise. Thus, for example, reference to "an array" includes more than one 
array, reference to "a polynucleotide primer" includes a plurality of polynucleotide 
primers and the like. 

In describing and claiming the present invention, the following terminology will 
be used in accordance with the definitions set out below. 

A "biopolymer" is a polymer of one or more types of repeating units. This 
includes traditional polynucleotides, the case in which the conventional backbone has 
been replaced with a non-naturally occurring or synthetic backbone, and nucleic acids in 
which one or more of the conventional bases have been replaced with a synthetic base 
capable of participating in Watson-Crick type hydrogen bonding interactions. 
Polynucleotides include single or multiple stranded configurations, where one or more of 
the strands may or may not be completely aligned with another. While probes and targets 
of the present invention will typically be single-stranded, this is not essential. 
Specifically, a "biopolymer" includes DNA (including cDNA), RNA and 
polynucleotides, regardless of the source. 

An "alkyl" group includes aliphatic and aromatic hydrocarbon molecules as well 
as substituted hydrocarbons and their derivatives. 

A "nucleotide" refers to a sub-unit of a nucleic acid and has a phosphate group, a 
5-carbon sugar and a nitrogen containing base, as well as analogs of such sub-units. An 
"oligonucleotide" generally refers to a nucleotide multimer of about 10 to 100 
nucleotides in length, while a "polynucleotide" includes a nucleotide multimer having 
any number of nucleotides. 



An "array" or "micro array", unless a contrary intention appears, includes any one 
or two dimensional arrangement(s) of addressable regions bearing particular biopolymer 
moieties (for example different polynucleotide sequences) associated with that region. An 
array is "addressable" in that it has multiple regions of different moieties (for example, 
5 different sequences) such that a region at a predetermined location (an "address") on the 
array (a "feature" of the array) will detect a particular target or class of targets (although a 
feature may incidentally detect non-targets of the feature). In the present case, the 
polynucleotide (or other) target will be in a mobile phase (typically fluid), while probes 
for the target ("probes") may or may not be mobile. "Hybridizing", "annealing" and 
10 "binding", with respect to polynucleotides, are used interchangeably. "Binding 
efficiency" refers to the productivity of a binding reaction, measured as either the 
absolute or relative yield of binding product formed under a given set of conditions in a 

□ given amount of time. "Hybridization efficiency" is a particular sub-class of binding 

rT efficiency, and refers to binding efficiency in the case where the binding components are 
CP 15 polynucleotides 

m The term ''Fluid" is used herein to reference to a liquid. 

The term "target" shall refer to a nucleic acid, nucleotide, nucleoside or their 

□ analogs. The term shall also include nucleotides having modified sugars as well as 
j S organic and inorganic leaving groups attached to the purine or pyrimidine rings. 

: ~ J 20 The term "probe" shall refer to a biopolymer such as a nucleic acid, nucleotide, 

nucleoside or their analogs. The term shall also include nucleotides having modified 
sugars as well as organic and inorganic leaving groups attached to the purine or 
pyrimidine rings. 

The term "specificity" shall refer to a ratio of specific to non-specific 
25 hybridization. 

The term "sensitivity" shall refer to signal intensity of a molecule that may or may 
not be attached to a micro array surface. 

The term "channel" shall refer to an area on an array that defines a particular type 
of feature of the array. 
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The term "stringent" or "stringency" shall refer to any condition or parameter 
imposed on a system or hybridization of probe and target that improves results (i.e. 
addition of a denaturant to increase hybridization efficiency). 

The term "standard hybridization temperature" shall refer to the temperature at 
which two oligonucleotides (i.e. a probe and target) anneal without the addition of any 
denaturant or other component(s). This is the hybridization temperature of the nucleic 
acids without any other components that effect the system thermodynamically. It is also 
related to a prescribed amount of heat and/or quantum of energy that is added to a closed 
system to produce a prescribed level of entropy and Watson-Crick base pairing. 

Referring first to FIGS. 1-3, typically kits and methods of the present invention 
use a contiguous substrate 1 carrying arrays 2 disposed across an array surface 3 of 
substrate 1 and separated by interfeature areas 4. The arrays on substrate 1 can be 
designed for testing an analyte or for evaluating probes on their ability to form hybrids. 
While a number of arrays 2 are displayed and shown in FIG. 1, the different 
embodiments described below may use substrates with particular numbers of arrays, it 
will be understood that substrate 1 and the embodiments to be used with it may use any 
number of desired arrays 2. For instance, substrate 1 may carry at least one, two, four, or 
at least ten arrays 2. Depending upon the intended use, any or all the arrays 2 may be the 
same or different from one another and each may contain multiple spots or features 6 of 
biopolymers. A typical array may contain more than ten, more than one hundred, more 
than one thousand or ten thousand features, or even more than one hundred thousand 
features in an area of less than 20 cm 2 or even less than 10 cm 2 . For example, features 
may have widths (that is, diameter, for a round spot) in the range from a 10 ^im to 1.0 cm. 
In other embodiments each feature 6 may have a width in the range of 1 .0 jxm to 1 .0 mm, 
usually 5.0 pm to 500 |im, and more usually 10 (xm to 200 |xm. Non-round features may 
have area ranges equivalent to that of circular features with the foregoing width 
(diameter) ranges. 

Some or all of the features 6 may be of different compositions. As mentioned, 
interfeature areas 4 will typically (but not essentially) be present which do not carry any 
polynucleotide (or other biopolymer of a type of which features 6 are composed). 
Interfeature areas 4 typically will be present where arrays 2 are formed by the 



conventional in situ process or by deposition of previously obtained moieties, as 
described above, by depositing for each feature at least one droplet of reagent such as 
from a pulse jet (for example, an inkjet type head). It will be appreciated though, that the 
interfeature areas 4, when present, could be of various sizes and configurations. It will 
also be appreciated that there need not be any space separating arrays 2 from one another. 
Each feature 6 carries a predetermined polynucleotide (which includes the possibility of 
mixtures of polynucleotides). As per usual, A, C, G, T represent the usual nucleotides. It 
will be understood that there is usually a linker molecule (not shown) of any known types 
between the front surface and the first nucleotide. 

FIGS. 1-2 illustrate an ideal array where actual features 6 are the same as the 
target (or desired) features, with each feature 6 being uniform in shape, size and 
composition, and the features 6 being regularly spaced. Such an array when fabricated by 
drop deposition methods, would require all reagent droplets for each feature to be . 
uniform in shape and accurately deposited at the target feature location. In practice, such 
an ideal result is difficult to obtain due to fixed and random errors during fabrication. 
Substrate 1, may have a thickness of less than 1 cm, or even less than 5 mm, 2 mm, 1 
mm, or in some embodiments even less than 0.5 mm or 0.2 mm. In the situation where 
the array is read by detecting fluorescence, substrate 1 may be of a material that emits 
low fluorescence upon illumination with the excitation light. The figures and description 
should not be interpreted as limiting the scope of the invention and are for illustrative 
purposes only. The invention has application to variety of micro array designs that allow 
biopolymers to hybridize. Similarly, substrate 1 may be of any shape, and any apparatus 
used with it adapted accordingly. 

It should be noted that an important part of the arrays 2 is the ability of the 
attached polynucleotides to hybridize to a known or unknown target 7 (not shown in 
FIGS.). Hybridization occurs by annealing of complementary strands. The kinetics of 
hybridization are well known in the art and studied. For instance, the kinetics of 
hybridization for an immobilized target can be defined by: 



kl[Cf][Cs] 



where kl is a constant, [Cf] is the concentration of the bound target sequence, a constant, 
5 [Cs] is the concentration of oligonucleotide in solution. However, a number of 

experiments have shown that hybridization depends on two processes; diffusion of the 
probe to the site and hybridization at the site. Therefore, the above equation can be 
written and more accurately defined by: 

10 kl[Cfj[Cs]+J 

where J is the diffusion coefficient of the probe. From this equation it follows that when 
the [Cf] is low, the hybridization is limiting and when the concentration of [Cf] is high 
the diffusion is limiting. 

15 Referring now to FIG. 3, a standard annealing curve is shown. The figure shows a 

duplex denaturation curve in which percentage of single stranded DNA is plotted against 
temperature. No denaturant has been added to the DNA. Complementary 
oligonucleotides are present in solution. The plot shows that as the temperature is 
increased the probability of annealing is increased. This is probably due to the increased 

20 energy that is added to the oligonucleotide chains in the form of heat. The chains begin to 
rotate and move in solution increasing the probability of complementary strands and 
bases to pair and hybridize to form more stable structures. The hybrization temperature is 
marked by the vertical line in the graph where the stability of the duplex formed upon 
hybridization can be expressed as a function of the temperature in which 50% of the 

25 molecules are hybridized (Tm). 

A molecule such as urea can be used in annealing or hybridizations reactions to 
lower the Tm. This is particularly effective in annealing DNA, RNA as a target and 
especially with dsRNA hybrids, as they have high Tms that necessitate elevated reaction 
temperatures. In a perfectly matched hybrid, the Tm may depend on a variety of 

30 conditions. Some conditions include ionic strength, the concentration and presence or 
absence of monovalent cations, base composition of the oligonucleotides, the size of the 
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oligonucleotides, the annealing properties including the amount of mismatches present 
and the presence of any denaturing agents. For instance, the Tm increases 16.6 degrees 
Celsius for a one fold increase in monovalent cations between 0.01 and 0.40 M NaCl. 
Also, for 25 mers, single base deletions show a decrease in Tm by 10 degrees Celsius. 
5 For every mismatch the Tm is lowered by 5 degrees Celsius. Denaturing agents such as 
urea reduce the Tm of a duplex in solution by approximately 30 degrees Celsius. 

FIG. 4 shows a series of melting curves with the addition of urea to 
complementary strands of DNA in solution. The figure shows a plot of % single stranded 
DNA vs. temperature. Melting curves are shown for 25 mer, 60 mer, and 60 mer with 
10 denaturant. The solid lines in the diagram indicate the hybridization temperatures after 
the denaturant urea is added to the sample. It should be noted that the temperature for 
optimal duplex formation is generally 5-10 degrees below the Tm. Maximum rates of 
hybridization are achieved at temperatures 20-25 degrees Celsius below the Tm (the 

LJ 

□ temperature at which the strands of a double-stranded probe are half dissociated) 
J 15 (MARMUR, J.G. and DOTY, P.J., Mol. Biol., 3, pp. 584, 1961). Hybridization 

temperatures at the probe Tm proceed very slowly. For longer probes, hybridization 
should be carried out at 65-68 degrees Celsius. However, this temperature is detrimental 
S to the micro array surface. Buffer modifiers such as urea significantly reduce the 
fjf effective Tm for duplex DNA by 15-30 degrees Celsius, enabling annealing to single 
Oi 20 stranded probes on the micro array surface to occur. This is particularly important for 

micro array surfaces, where high temperatures are detrimental to surface stability. A Tm 
modifier , for example, 50% formamide, is capable of reducing the Tm of a nucleic acid 
hybrid so that hybridizations can be carried out at 35-42 degrees Celsius. The advantages 
of using formamide-based hybridization buffers for nylon membrane hybridizations is 
25 that probes are more stable at the lower temperatures and there is better retention of non- 
covalently bound nucleic acids on the membrane. Formamide has no effect on 
hybridization rate when used in the range of 30-50% (Howley, P.M., et al, J. Biol. Chem., 
254, pp. 4876, 1979). The figure clearly shows that as the denaturant is added to the 
oligonucleotides in solution, the hybridization temperature is dropped below the original 
30 melting temperature. This is ideal for use in micro arrays, because less energy is then 
needed to heat the system to get maximum hybridization. Secondly, since lower heat is 
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applied and the temperature is lowered there are less denaturing effects on the actual 
oligonucleotide strands. Also, although the Tm of the 60 mer is higher than the 25 mer, 
the addition of denaturant still lowers the Tm and hybridization temperature. 

The invention provides a solution to be used with the arrays 2. The solution 
contains a chemical moiety having the formula: 

R(NH 2 )C=0 

where R is NH 2 or CH 3 . In one embodiment of the invention R is an amino group (the 
resulting compound is urea). In a second embodiment R may be an alkyl group (if 
R=CH 3 , the resulting compound is acetamide). The case where R is a hydrogen atom is 
excluded (when R=H, the resulting compound is formamide, and the toxicological 
properties of the compound are very different). Suitable ligands may also include alkyl, 
substituted alkyl, aryl, substituted aryl, aralkyl, substituted aralkyl, and their derivatives. 

Each of the above embodiments of the invention may be used with a solution that 
may be added to arrays 2 or other process or apparatus that allows for the hybridization of 
a nucleotide, biopolymer, or similar type component in a reaction solution or mixture. 

Denaturant Buffer Optimization 

Tests were conducted in order to achieve hybridization optimization with various 
denaturants. In addition, although short (20-25 base) oligonucleotides should 
theoretically provide the greatest discrimination between related sequences, observation 
by others have shown that short oligonucleotide lengths provide poor hybridization 
properties (Guo et al., 1994; Shchepinov et al., 1997). For these reasons, studies were 
conducted to maximize hybridization efficiency at longer lengths such as the 60-mer. 
These longer oligonucleotides or biopolymers have the advantage of more closely 
resembling the solution state with enhanced hybridization properties (Guo et al., 1994; 
Lockhart et al., 1996; Shchepinov et al., 1997). 

In order to achieve optimal sensitivity and specificity of hybridization of 60-mer 
oligonucleotide arrays to complex target cRNA, a reduction of the Tm of the 
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oligonucleotide and target is required. In other words, the length of the oligonucleotide 
requires added amounts of heat or energy to obtain hybridizations. However, this added 
energy or the level required for complete hybridization often requires levels of heat or 
energy that can simultaneously destroy the same biopolymers that are desired to 
hybridize. 

Denaturant Study Structure 

Grids experiments were designed using a 3x3x2x2 construction optimized for 
hybridization on the metric of "Mean Specificity" (mean specificity is defined in the 
equation that follows) as a function of probe length (20 to 60-mer). Tests were conducted 
using 3, 4, 4.92 M urea at various temperatures including 40, 45, and 50 °C. Results were 
duplicated and dye swapped pairs were used to control for any Cy3 vs. Cy5 channel bias 
(defined as + or - polarity). Mean specificity is given as the average over all the probes of 
a given length to a given gene (GCN4=Yel009C). 

Samples and Labeling 

Samples were labeled with fluorescent dyes (Cy3 and Cy5) for hybridization 
studies. The specific target used included a pure labeled GCN4 antisense RNA. Cross- 
hybridization targets included an amplified (polyA+- specific) labeled GCN4 /GCN4 
yeast knockout (KO) total RNA. In each experiment the (+) polarity was the Cy3 labeled 
specific target and the Cy5 labeled KO target. The (-) polarity was the Cy5 labeled 
specific target and the Cy3 labeled KO target. Data from each of the polarities was 
combined in the dye swap pairs. Dye swaps were performed to eliminate experimental 
and channel bias. 

Specific Signal vs. Probe Position (GCN4) 

Specific signal and probe position are highly correlated. Previously it has been 
shown that hybridization efficiency is dependent on probe position. Probes designed 
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closer to the 3' end of the corresponding gene give higher signal intensities due to the 
nature of the enzyme transcriptase activity. In addition, signal strength increases as probe 
length increases and frequency of good probes increases as probe length increases. These 
factors were important to consider in the study. 

5 

Cross-hybridization Problem 



Cross-hybridization is a problem in hybridization experiments. Increased 
hybridization stringency decreases cross-hybridization problems in annealing 
10 experiments. Therefore, various parameters must be tested in order to arrive at maximum 
sensitivity and specificity with a denaturant. In other words, we want the ideal conditions 
Z and concentrations to produce effective hybridizations below the Tm of a biopolymer and 

O target. The problem with such experiments being that specificity (most effective correct 

o 

M, binding) to sensitivity (maximum signal detection) are not easily achieved. In other 

'tl 15 words, optimal specificity may be achieved at a different stringency than optimal 

51 sensitivity. For formamide, optimal specificity has been observed for at a formamide 

Ijl concentration of approximately 4x that for optimal sensitivity. The KO mutant protocol 

y was employed for comparisons to pure cross-hybridization signals. An optimization 

; y 

If! metric was employed to quantify mean specificity. The mean specificity can be given by 

y- 20 the equation: 



+ Polarity - Polarity 

max(Ssig, Smin) g ,i + max(Ssig, Smin) r>i 

25 Specificity = . 

max(Xsig, Smin) g ,i + max(Xsig, Smin) rji 

Where, Ssig is SPECIFIC SIGNAL 

Xsig is CROSS HYB SIGNAL 

30 Smin = 2.6 G Background 

N 

MeanSpecificity = 1/N 2 Specificity 

i=l 

35 
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Observed Optima for Various Probe Lengths 
and Temperatures (Urea as a denaturant) 



FIG. 5 shows a contour plot of the observed optimal conditions using urea 
5 concentrations with varying probe lengths. Urea concentrations from 2.5 to 5.5 M are 
plotted on the x-axis and probe lengths were tested from 10 to 70-mers using similar 
targets. The optimum ridge shows the optimal conditions for hybridization with the 
addition of the denaturant at 50 °C. The ideal is a denaturant concentration that provides 
the greatest effect on the varying probe lengths when the temperature is held constant. 
10 The ridge represents this area on the contour plot. No optimal urea concentrations were 
bracket (area that is encircled in diagram) for longer probes at 40 or 45 °C. The optimal 
urea concentrations bracketed were at 50 °C for a probe length (L= 30, 35, 40 45, 50 and 
55). No optimal urea concentration was bracketed at 50 °C for L = 60. However, the 
results were extrapolated using the optima for other lengths. 
15 FIG. 6 shows the optimal urea concentrations of 50 °C probe length series. The 

S| plot shows average GCN4 specificity plotted against urea concentration. Each of the 
f s curves is concave down with a maxima around 4.0 M urea concentration. The curves in 

0 each case are parabolic and show predictable trends. 

in FIG. 7 shows the extrapolated optimal urea concentration for a probe length of 

H 20 60-mer at a temperature of 50 °C. Optimum urea concentration at 50 °C is plotted against 

probe length. Probe length is graphed from 25 to 65-mer. For the 60-mer the optimal urea 

concentration is predicted to be between 5.1 and 5.3 M. 

FIG. 8 shows a contour plot of the extrapolation to predicted optimum for the 

probe having a length of 60. Probe length has been plotted against urea concentration. 
25 The ridge in the diagram shows the optimum predicted for the denaturant on the 

prescribed conditions. The diagram shows that the optima for these experiments are very 

broad. Therefore, denaturants added to a probe and target can be very effective in 

lowering Tm for improved hybridizations. Maximum urea concentration being 

approximately 4.92 M. 

30 FIG. 9 shows the results of the Cy3 and Cy5 labeled micro arrays. The diagram 

shows the effects of oligonucleotide length and hybridization stringency on hybridization 
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to yeast genes. Three yeast genes are shown: YDR345C (HXTT3), YER0109W (ISC1) 
and (GCN4) are tiled on a DNA oligonucleotide micro array. The oligonucleotide probes 
corresponding to each of the three yeast cDNAs are of increasing length, 20, 25, 30, 35, 
40, 45, 50, 55, 60-mer, and have been tilted along the gene, spaced three oligonucleotides 
5 apart and arrayed in a 75 rows x 195 column format. Reverse gray images reveal 
hybridization of oligonucleotides probes in 4.92 M urea at 40 °C. Cy4 labeled Yeast 
GCN4 KO cRNA (2 jug), highlites HXT3 probes on the array. Whereas PCR amplified 
Cy3 labeled full length GCN4 DNA (0.5 ng) highlites GCN4 probes. Probes hybridizing 
equally to Cy5 and Cy3 are labeled yellow. ISO, in the middle of the array, is a no- 
10 expressing gene. The diagrams clearly show that hybridization takes place at lower 

temperature with the added denaturant. Similar hybridizations can be seen in the low and 

O high temperature micro arrays. However, the higher temperatures show slight problems 
with delamination, while the lower temperature graphs show an absence of such 

H problems. For an example of the solution(s) used in the experiments see Tables 1-3. 

Sj 15 Table 1 is the general buffer system. Table 2 shows a solution to be employed in the 

m Table 1 preparation. 

H The invention also provides a kit for using the composition. The kit provides for 

hs hybridizing biopolymers at temperatures below their standard hybridization temperatures, 
y The kit comprises a micro array; a composition for use with the micro arrays, a target for 
^ 20 detection and an optional set of instructions. The optional set of instructions may describe 
how to use the denaturant to obtain most effective binding hybridization temperatures. 
The micro array may be constructed of any number of probe lengths. The target may be a 
known or unknown biopolymer. The composition includes the chemical component 
R(NH 2 )C=0 where R is an amino or an alkyl group. The chemical component may be 
25 used alone or in solution. 

Having described the composition and kit of the present invention, a brief 
description of the method is now in order. The method includes the application or use of 
the same chemical composition. The composition may or may not be in a buffered 
solution. The method allows a probe to hybridize to a target at a temperature lower than 
30 their standard hybridization temperature. The steps of the method comprise adding to the 
probe and target a chemical component of the formula: R(NH 2 )C=0 where R is an 
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amino or an alkyl group, heating the probe and target in the presence of the added 
component, and then allowing the probe and target to hybridize. 



EXAMPLE 1 

5 Micro arrays 

Spotted cDNA arrays. Robotically deposited arrays of PCR products, each 
corresponding to a single yeast ORF, were as previously described ( Marton, M J. et al., 
1998). 

10 Oligonucleotide arrays. Surface-bound oligonucleotides were synthesized 

essentially as proposed (Blanchard et al., 1996). Hydrophobic glass wafers (75 x75 mm) 
containing exposed hydroxyl groups for nucleotide coupling were prepared using a self 
0 assembling silane mixture to form a uniform, hydrophobic surface with a controlled 
£7 density of hydroxyl groups extending by a single methylene unit above unfunctionalized 
2 15 silane components (S.M. Lefkowitz et al, unpublished data). Phosphoramidite monomers 
fij in propylene carbonate were delivered to defined positions on the glass surfaces using 
commercial ink-jet printer heads. Synthesis cycles were otherwise similar to traditional 
O oligonucleotide synthesis, with the exception that capping was omitted. After 
in manufacture, arrays were diced to 25 x 75 mm to accommodate commercial scanners. 
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EXAMPLE 2 

Oligonucleotide selection for micro arrays 



Saccharomyces cerevisiae oligonucleotide set design were constructed as 
25 described below. Sequences were chosen from the full genome sequence (obtained from 
http://genome-www.stanford.edu/Saccharomyces) . All 60-mer sequences with the 3' kb 
of nuclear encoded ORF sequences were evaluated. First, low-complexity sequence was 
masked using RepeatMasker (University of Washington and Genetic Information 
Research Institute). Second probes with 35-45% (G+C) were taken. Among these, a set of 
30 oligonucleotides overlapping by < 40 bases was derived and analyzed by BLAST 

(http://www.ncbi.nlm.nih.gov/BLAST/) to identify those with lowest similarity to other 
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source sequences. The eight remaining probes with the highest predicted specificity were 
selected to represent the source sequence. In some cases, eight oligonucleotides meeting 
these criteria were not identified, in which case selection stringencies were lowered 
and/or fewer than eight oligonucleotides were printed. 
5 Human oligonucleotide sets were designed as described below. A total of 49,218 

Homo sapiens sequences were picked from the longest messenger RNA (mRNA) 
sequences representing UniGene clusters (Release 111, 15 April, 1999) 
(http://www/ncbi.nlm.nih.govAJniGene/) . For each sequence, every 60-mer between 50 
and 350 bases from the 3' end of the sequence was evaluated. First, oligonucleotides 

10 were filtered to remove with any repeat elements. Simple repeats or repeat elements were 
detected using Repeat Masker. Oligonucleotides with homopolymeric stretches of more 
than six bases were also rejected. Second, oligonucleotides were filtered to be <50% G 
and 25-70% (G+C). Among those remaining, a set of oligonucleotides overlapping. by 
<30 bases was derived and analyzed by BLAST to identify those with lowest similarity to 

15 all other source sequences. The surviving oligonucleotide with the highest predicted 

specificity was selected to represent its source sequence. The 23,965 oligonucleotides on 
the array used are those among 49,218 with the highest signal intensity when hybridized 
with cRNA prepared from Jurkat cells. 

Mutations and deletions were performed as described below. Mismatch bases 

20 were "mutated" at random, for example, a mutated guanine had an equal possibility of 
mutation to each of the other possible bases (adenine, cytosine, or thymidine). In the case 
of deletions, additional sequence adjacent on the synthetic mRNA was added so that a 
total of 60 complementary bases was retained. For each number of randomly placed 
single-base mismatches or deletions, 110 different oligonucleotides were designed. For 

25 sequential mismatches, each position was represented by a single oligonucleotide 
sequence printed on five different spots. 
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EXAMPLE 3 

Cell culture 

Saccharomyces cerevisiae cultures were grown as described (Marton et. al., 
1998). The gcn4A/gcn4A strain is R1792; the hxt3A/hxt3A strain is R5918.Strains that are 
often used and profiled are prototrophic diploid (i.e. R1165). NB4 cells (human 
promyelocyte leukemia) were obtained from the American Type Culture Collection 
(Rockville, MD). Human cell lines were grown in RPMI medium supplemented with 
10% fetal bovine serum in an atmosphere of 5% C0 2 . 

EXAMPLE 4 

RNA isolation 

Yeast total RNA was isolated by phenohchloroform extraction and poly-A 
purified as described (Marton et al., 1998). Human total RNA was isolated using TRLzol 
(Life Technologies, Rockville, MD) or Rneasy (Qiagen, Valencia, CA). 

EXAMPLE 5 

Synthetic cRNA and mRNAs 

Full length S. cerevisiae ORFs GCN4 and HXT3 were amplified by PCR from a 
genomic DNA template and ORF specific primers. The T7 RNA polymerase (T7RNAP) 
promoter sequence was introduced into the antisense primer for each ORF. ORF specific 
cRNAs were then produced from the DNA templates by in vitro transcription using a kit 
well know in the art (MegaScript, Ambion, Austin Tx). After labeling (as described 
below), these synthetic cRNAs were used directly for micro array hybridization. The 
array was hybridized with a mixture of 2 fig Cy3-labeled cRNA from a GCN-4 deficient 
strain of S. cerevisiae and 0.5 ng Cy5-labeled GCN4 ORF-specific IVT product (cRNA) 
(~ 4.5 copies per cell of the 843 nt GCN4, ORF, assuming ~ 15,000 transcripts per S. 
cerevisiae and an average transcript length of 1 kb). For spike-in RNAs, a portion of the 
adenovirus type 5 Ela (nucleotides 560-972) was amplified by PCR from a full-length 
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plasmid clone of the gene and subcloned into the pS64Po!y(A) Vector (Promega, 
Madison, WI). Random 60-mer oligonucleotides were then cloned into Xbai/BamHI sites 
of this subclone, adjacent the poly- A sequence. Individual clones were isolated, and the 
sequences of the inserts determined. To generated synthetic mRNAs, clones were 
linearized with EcoRI and an SP6 IVT reaction was then performed. This reaction was 
then followed by DNase treatment of the product. Synthetic mRNAs were purified on 
Rneasy columns (Qiagen). 

EXAMPLE 6 

MRNA reverse transcription and amplification 

In vitro transcription (IVT) was performed as modified (DeRisi, et aL, 1997). 
Total RNA was used as input for cRNA synthesis. An oligo-dT primer containing a T7 
RN A polymerase promoter sequence was used to prime first strand cDNA synthesis, and 
random hexamer were used to prime second stranded cDNA synthesis by Maloney 
murine leukemia virus (MMLV)RT. This reaction yielded a double stranded cDNA that 
contained the T7 RNA polymerase promoter at the 3' end. The double stranded cDNA 
was then transcribed into cRNA by T7RNAP. 

PCR-coupled IVT (PCR-IVT) was performed as described below. 3'end cDNA 
was synthesized by an adaptation of the protocol of Zhao et al. (Zhao et aL, 1998). To 
prevent transcript detection biases due to unequal amplification of certain sequences 
during PCR, the amount of input RNA was increased to 2.4-4.0 fig and decreased the 
number of PCR cycles to 10. To allow further sequence amplification by cRNA 
synthesis, a T7RNAP promoter sequence was added to the 3 '-end primer sequence used 
during PCR. Following PCR, amplified DNA was isolated by phenol/chloroform 
extraction and used as a template in an IVT reaction (MegaScript Ambion). 
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EXAMPLE 7 

Sample labeling, hybridization, 
scanning and image analysis 

5 Different cDNA and cRNA were labeled with Cy3 and Cy5 dyes using a two step 

process. First, allylamine-derivitized nucleotides were enzymatically incorporated into 
cDNA or cRNA products. For cDNA labeling, a 1 : 1 mixture of 5-(3-aminoallyl) 
thymidine 5 '-triphosphate (Sigma) and thymidine triphosphate (TTP) was used in place 
of TTP during cDNA synthesis. For cRNA labeling, a 1:3 mixture of 5-(3- 
10 aminoallyl)uridine5'-triphosphate (UTP) was substituted for UTP in the IVT reaction. 
Allylamine derivatized cRNA or c DNA products were purified using commercial 
columns. Columns were washed 3x with 80% ethanol and eluted with water. Purified 
o products were reacted with N-hydroxy succinimide esters of Cy3 or Cy5, following the 
jSi manufacturer's instructions. Dye molecules were separated from labeled products using 
15 standard commercial columns. Cy3 labeled cRNA or cDNA from "control" cells were 

g} mixed with the same amount of Cy5-labeled product from "experiment" cells (-30 fig for 

ffl 

cDNA or 5 fig for cRNA per human sample per hybridization, unless otherwise noted). 
; All hybridizations were done in duplicate and fluor reversal, to compensate for biases 
jy cause by differing chemical properties of the fluorescent dye molecules as well as for 

20 biases associated with normalization. Before hybridization, labeled cRNAs were 
H fragmented to an average size of -50-100 nt before incubation at 60, 50, 45, or 40 

degrees Celsius for 30 minutes in the presence of 10 mM ZnCl 2 , cDNAs or fragmented 
cRNAs were added to hybridization buffer as described below. Hybridizations were 
carried out in a final volume of 3 ml in a flexible plastic enclosure at 40 degrees Celsius 
25 on a rotating platform in a hybridization oven (Robbins Scientific, Sunnyvale, CA). After 
hybridization for 16-24 hours, slides were washed (rocking -30 s in 6x SSPE, 0.005% 
sarcosine, then rocking 30 s in 0.06 x SSPE) and scanned using a confocal laser scanner 
(General Scanning, Wilmington, MA; Genetic Microsystems, Woburn, MA; or Axon, 
Foster City, CA). Fluorescence intensities on scanned images were quantified, corrected 
30 for background and normalized (Marton et al., 1998). 
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EXAMPLE 8 

The solution or reaction mixture 



Table 1-3 shows a description and breakdown of the components used in a 
5 reaction solution. The components generally include nuclease free water, 5 M MCI 
(M=Na + , Li + , or other suitable monovalent cation), 1.0 M M-MES, pH = 6.4, and 10% 
w/v Sarcosine (sigma L-9150). The volumes for each of the components are shown in the 
tables for the case where M=Na + . Solutions may be altered and components changed. The 
guiding design principle is that the addition of the urea, amide, or an amino component 
10 allows for hybridization of the polynucleotide or biopolymer on the micro array surface 
at lower temperature than the standard hybridization temperature without the addition of 
the denaturant. The optimization of the chosen temperature and the composition of the 
hybridization buffer represents a balancing of the stabilization of perfect Watson-Crick 
nucleic acid duplexes and the destabilization of imperfect duplexes, i.e. the balance 
15 between sensitivity and specificity. [NOTE: optimal hybridization is usually achieved at a 
temperature about 5°C below the melting temperature of the desired duplex. Melting 
temperature is effected by monovalent cations (usually Na + or Li + ) concentration, probe 
;1T length, probe sequence, modifier concentration and target concentration. The affect of 
[U the modifier (urea, acetamide, etc.) is to lower the melting temperature, all other factors 
□ 20 being equal, by destabilizing the hydrogen bonding of Watson-Crick base pairs. -pKw]. 

EXAMPLE 9 

25 Effects of Various Molarities on 

Oligonucleotide Hybridization Temperature 

FIGS. 5-9 show the effects of various urea molarities on the polynucleotide or 
biopolymer hybridization temperature. As can be seen in the figures, higher amounts of 
30 urea are effective in lowering hybridization temperatures. Note that as the probe length 
increases, there is also a higher required molarity of urea to obtain the maximum or most 
effective hybridization temperature. 
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For further questions regarding any of the procedures in the described examples 
see (Hughes et al., 2001). 



5 



10 



□ 
O 

o 

m 
m 



15 



O 

ry 

ui 20 

O 



25 



30 



21 



Table 1 



Hybridization Title: 


6CN4 Experiment Protocol 


Dye swap Polarity 


(-) 






Array Design File 


5cerevisiae_hxt3_yer019w_gcn4_20-60mers.txt 




Total Number of slides/T emp/Treat 


2 






Final Volume per slide: 


3000,0 fj\ 






RNA FRAGMENTATION STEP 












4.92M Urea 


Final [Component] 




[Component] 


Volume Amount 


nuclease-f ree water 


100.0% 


110.0 iJi 


3.7% 


GCN4 spike (S.Cerevisiae) Cy5 


0.10 pg/ml 


10.20000 0.5 ng 


0 00017 m/ml 




339.30 /jg/ml 


11.85 ^ 2 \XQ 


0.67 ug /ml 


Random 25-mer 


10.00 mg/ml 


60.00 d 


0.10 mg/ml 


25 mM Zn(OAc) 2 , 975 mAA Tris, pH 7.6 


25 x 


8.00 //I 


1.0 x 


Fragmentation volume: 


100.00 fi 


200.0 iA 




0.5M EDTA POST-f ragmentation stop volume 


5.00 //I 


10.00 




Total Fragmented Volume: 




210.00 yl 




HYBRIDIZATION STEP 








Fragmented sample volume per tube: 




210.0 iA 




Urea Volume Concentrate(8M Urea) 




3690.0 ^ 


4.92M Urea 


20/7X Hyb Buffer Concentrate 




2100.0 //I 


IX 


Water 




0.0 iA 




Total 




6000.0 //I 





22 



Table 2 





Desired Volume (ml): 


25 










20/7x (2.86x) Urea Hyb. Buffer 


















2.86x 
Concentrate 


Pinal 


- i: 

nl 


Component 


[Component] Volume (ml) 


[Component]/2 . 86x 
(in hyb. chamber) 




nuclease-f ree water 


100.0% 


2.71 


10.9% 


3.8% 


w 


5M NaCI (nuclease-f ree) 


5.0 M 


14.29 


2.9 M 


1.0 M 




1.0M Na-MES, pH 6.4 


1000.0 mM 


3.57 


142.9 mM 


50.0 mM 




0.5 M EDTA (pH 8.0) 


500.0 mM 


0.86 


17.1 mM 


6.0 mM 




10% w/v Sarcosine (Sigma L-9150) 


10.0% 


3.57 


1.4% 


0.5% 


; .. : . 
5 


Total 




25.0 ml 







Table 3 



Desired Volume (ml): 


10 








2X QC2-3 Hyb. Buffer 








Final 




[Component] 


Volume 


2X 


[Component]/2X 


Component 


(ml) 


Concentrate 


(in hyb. 
chamber) 


nuclease-f ree water 


100.0% 


0.23 


2.3% 


1.15% 


8M LiCl (nuclease-free) 


8.0 M 


1.53 


1.224 M 


0.612 M 


0.5M Li-MES, pH 6.1 


500.0 mM 


6.00 


300.0 mM 


150.0 mM 


0.5 M EDTA (pH 8.0) 


500.0 mAA 


0.24 


12.0 mM 


6.0 mM 


15% w/v Lithium Dodecyl Sulfate, 10% Triton X-100 


15.0% 


2.00 


3.0% 


1.5% 


Total 




10.0 ml 
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